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ABSTRACT 

(U) A general survey is made on the data collected by the Village 
Information System, Thailand project. Principal component analysis and 
principal factor analysis data reduction techniques are applied to the 
data for selected areas in northeast Thailand and the results are can- 
pared. Algebraic models are applied to a selected variable of the data 
and forecasting techniques applied to each model to predict the value 
of the variable in the next time period. Conclusions are presented con- 
cerning the operational usefulness of the analytical techniques applied 
to the data. 
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I. INTRODUCTION (u) 



A. PURPOSE (U) 

(U) This thesis continued the study of insurgency incident reports 
fran Thailand which was begun in a thesis by Major P. E. Gardner, USMC 
[1]. Gardner described and documented the development of the insurgency 
in Thailand, within the security classification level of this document, 
and the establishment of the Village Information System, Thailand 
project to collect and process data concerning the insurgency. That 
information is available and will not be repeated here. 

B. SCOPE OF ANALYSIS (U) 

(U) An expanded data base, containing approximately four times the 
initial number of incident reports, was available for this thesis. This 
data base was surveyed with respect to the information it purported to 
contain, information it was believed it should contain, the manner of 
presentation of the collected information, and the observed information 
in the incident report files themselves. These areas were evaluated, 
critiqued, and reccntnendations made for changes or additions which the 
author believed might improve the usefulness of the data base. It was 
fully realized that such a survey and recarmendations could only be made 
in a very general sense since the author was several thousand miles 
away, had no personal experience concerning the insurgency or counter- 
insurgency activities in Thailand, and had only the data tape and a few 
after-action and trip reports relating directly to the data base for the 
thesis . 
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(U) The factor analysis technique applied by Gardner was applied 
to the expanded data for the same areas, and the resulting factor 
patterns compared with those obtained by Gardner. Two different tech- 
niques of factor analysis were then applied to a new major geographical 
area and its major subdivisions. The two factor analysis techniques 
were compared for the different size data sets and also using sets of 
generated data with known underlying probability distributions for the 
variables under consideration. 

(U) Prediction of levels of insurgent activity based on the infor- 
mation contained in the data was considered. Criteria for predictions 
and forecasting techniques in a counterinsurgency environment were 
developed, and three algebraic forecasting models were presented, applied 
to six geographical areas representative of different patterns of insur- 
gency, and evaluated for applicability and usefulness as an aid in 
conducting a counterinsurgency program. 

(U) It will be noted that there are seme variations in the number of 
incident report files from a given area in different phases of the 
analysis effort. This was a result of the elimination, either manually 
or by the computer , of sane incident report files counted in the total 
of 6264 reports constituting the data base. The eliminated files were 
those in which information was missing in the variables necessary for a 
particular analysis. 
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C. CLASSIFICATION (U) 

(U) The overall security classification of this thesis is CONFI- 
DENTIAL, Not Releasable to Foreign Nationals. The information on the 
data tapes is UNCLASSIFIED as recorded, but the description of an inci- 
dent report in terms of the variable code definitions is CONFIDENTIAL . 
As the material is concerned with the collection and evaluation of 
intelligence information, it was placed in Group-3 in accordance with 
Department of Defense Directive 5200.10. 



II. VILLAGE INFORMATION SYSTEM, THAILAND (U) 



(U) The background and development of the Village Information 
System, Thailand (VIST) project has been described and documented by 
Gardner [1], and no further information in these areas was received. 

A second magnetic tape of VIST incident data (VIST Task 2) was made 
available by the Stanford Research Institute subsequent to the comple- 
tion of the analysis by Gardner, and it was this second, expanded base, 
tape which provided the data set for the analysis which was conducted 
in the preparation of this thesis. 

A. EXPANDED VIST DATA SET (U) 

(U) The data set used contains 6264 incident files, compared to 

1592 in the initial set, for a 33 month period from January 1967 through 

September 1969. The 80 column format for an incident file is shown in 

Appendix A. In addition to providing coverage for an enlarged time 

span, the expanded data set also increased the area coverage from six 

Changwats to seven and added many Amphoes to the initial six Changwats. 

. ) ) ) > » - « - 

The expanded list bf gebgfaphical ’areas and tJaeir cocles are ; sliown in 

Appendix B. 
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B. GENERAL ANALYSIS OF VIST DATA (U) 



(U) Before any attempt was made to apply statistical or mathematical 
techniques of analysis to the data set, a general critical survey of the 
data was conducted so that a basic evaluation of the data set could be 
made. This survey considered both the overall aspects of the incident 
data reporting and recording system within the VIST project with respect 
to the 80 column coded representation of a report, and the specific 
elements of reported information contained within the data set. 

1. The Incident Reporting System 

(C) The failure of the VIST project to proceed into Task 3, 
the reporting and recording of Royal Thai Government (RTG) forces data, 
seemed to be the greatest weakness of the VIST project frcm the point of 
view of usefulness of the coded data. Regardless of the amount of 
analysis performed or the analysis techniques applied, the results would 
still have to be interpreted as only a one-sided picture of Communist 
Terrorist activity. In the absence of information concerning RTG counter- 
insurgency forces and operations, both military and civic action; there 
was no way to make interpretations of significant underlying cause-effect 
relationships which might be indicated by the results of data analysis. 

An increase in insurgent activity followed by a sharp decrease in a given 
area could indicate aggressive and successful RTG counterinsurgency 
operations or a virtual ceding of the area to Caimunist Terrorist control. 
In the same manner, periodic cycles of varying amplitude could be indica- 
tive of the effectiveness of RTG counterinsurgency programs in retaining 
control of an area against a persistent Communist Terrorist offensive, 
with the cycles being caused by the time lag factor in resupplying 
personnel and equipment to the insurgent organization frcm outside 
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Thailand following an unanticipated insurgent program setback; or it 
could be the effects of local weather on insurgent mobility. In this 
regard it was noted that the local weather was not an item of information 
reported. 



(C) The second major element of the reporting system considered was 
that the data information contained in an incident file represented a 
report received under an incident code and not a single, exclusive 
representation of an event. The incident codes are listed and defined 
in Appendix C. After several series of successive reports were reviewed 
from different geographic areas the following hypothetical event was 
constructed: 



A Communist Terrorist courier was sighted by a police patrol boat 
while crossing the Mekong frcm Laos. The courier was apprehended 
the next day approximately 15 kilometers frcm the border crossing 
point while trying to avoid a checkpoint set up by the Regional 
Force in response to the police alert. The courier was carrying 
mail addressed by Vietnamese name only and an operational 
directive for a planned increase in armed propaganda and terrorism. 
After being transferred to an RTA Interrogation Center, the courier 
revealed the location of two base camps he was to service in the 
mountains and the general location of his point of departure, a 
headquarters and training center, located in Laos. 

The inflation factor was apparent, but it was also apparent that the 
majority of the incident codes were not readily subject to multiple 
incident reports. The inflation factor was considered significant how- 
ever, based on the relatively small numbers of incidents occurring per 
month in most amphoe size areas, particularly with respect to total 
incident level forecasting. One method of reducing multiple reporting 
of one event or instance of Ccmnunist Terrorist activity would be the 
assignment of an identification "tag" to an event, such as a firefight, 
where this tag would be an element of all reports resulting from the 
examination of documents or the interrogation of prisoners obtained as 
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a result of the encounter. Another approach would be the review of the 
incident reporting codes to eliminate those items which are essentially 
of intelligence interest only and do not constitute events in space or 
time. Rarely, if ever, is the capture of a document an isolated event 
or of significance in itself; the significance, if any, steins from the 
information contained within the document which can be exploited by the 
RTG in its counterinsurgency program. 

(C) Within the format of the data elements coded (Appendix A) , the 
categories of: incident reporter, incident initiator, and incident tar- 
get require a total of 16 columns of 0 or 1 entries. Since each of these 
categories is partitioned, numerical coding of the variables could 
represent the same information in three columns. 

2. The Reported Information (U) 

(U) It was discovered that a useful aid in an overall analysis 
of the information content of the VIST data was a by-product of computer 
analysis routines performed on the data, namely, a listing of the means 
and standard deviations of the variables under consideration. The 
listing for the total data set, 6264 incident report files, is shown in 
Table I. 
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TABLE I 

OBSERVED DATA PARAMETERS 



VARIABLE 

1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 
27 



MEAN 

26.131195 

0.171372 

0.274179 

0.070362 

0.108319 

0.147906 

0.217509 

0.048523 

0.013565 

0.028411 

0.172968 

0.004628 

0.078781 

0.076390 

0.038309 

0.052996 

0.001117 

14.444157 

5.786589 

0.468303 

0.042939 

0.109432 

0.024739 

0.492889 

15.070463 

0.678805 

8.531009 



ST .DEV. 

18.011444 

0.376819 

0.446067 

0.255772 

0.310793 

0.354994 

0.412522 

0.214827 

0.115676 

0.166154 

0.378213 

0.067870 

0.269410 

0.265678 

0.191950 

0.223996 

0.033409 

9.418326 

3.274970 

0.498937 

0.202728 

0.312135 

0.155335 

0.499886 

41.106903 

9.149450 

8.106230 



(U) The analysis effort was placed primarily of the means of the 
variables since no specific interpretation or significance could be 
assigned to the variance of a dichotomous , zero-one, variable. 
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(C) Variable number one was the incident code reported. When the 
incident codes (Appendix C) were studied, it was concluded that there 
was a sufficient ordering underlying the values of this variable, from 
more violent to less violent with increasing magnitude of the variable, 
for comparisons to be made. This ordering was very fundamental and 
very dependent on the interpretations of the definitions of the inci- 
dent codes since the definitions are not unambiguous. The numerical 
gaps in the code and the predominance of nonviolent codes, which 
seaned to have a lower degree of ordering than the violent codes, 
forced the conclusion that no useful meaning could be assigned to the 
absolute numerical value of the mean for any geographical area or time 
span. However between different areas or time spans, the difference 
in the means would provide, in a very general sense, a relative measure 
of the degree of violence associated with the insurgency activity. 

The significance of such a measure was considered to be derived from 
degree of violence as an indicator of phase progression in the 
generally accepted formula or model of a Canmunist-type insurgency or 
war of liberation. 

(C) Variables two through seven constituted the reporter of the 
incident. Since the six variables constitute a partition of the set of 
incident reporters, it was expected that the mean value of a variable 
would indicate the propertion of the total incidents reported by that 
variable. The sum of these means was equal to .989647 vfaich indicated 
that the identity of the originator of the incident report was recorded 
on the report and transferred to the data set for about 99 per cent of 
the reports in the data base. Variables eight through twelve consti- 
tute a partition of the set of incident initiators. The sum of these 

> » > 

CONFIDENTIAL 
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means was .268095 with the mean of the unknown initiator variable 
(Variable 12) equal to .004628. If it had been the case that the sum 
of the means was nearly equal to one, the means would have been useful 
as indicators of the proportions of activity initiated by government 
forces and insurgent forces. No explanation was found for the high 
proportion of incident files with no information recorded in this 
category, and, with only about 27 per. cent information, the application 
of proportions to the total recorded was not considered to be a valid 
technique for approximating the actual proportions which would have 
been available without missing data. The same situation was observed 
in the consideration of variables thirteen through seventeen, the 
target of the incident category, where the sum of the means was equal 
to .247593, with the mean of variable seventeen (questionable target) 
equal to .001117. It was noted however, that all the variables in this 
category tended to reflect incidents which would be initiated by 
insurgents and not government forces. The largest proportion of the 
recorded information concerning the initiator of incidents corresponded 
to Carntunist Terrorists, but the next largest proportion corresponded 
to the Royal Thai Army initiated incidents. It was conjectured that 
the fact this category had the lowest sum of means might ston from a 
feeling among report originators that the category was not exhaustive 
and no entry at all might be better than an erroneous entry. 

(C) Considering the completely ordered variables; day of month 
(Variable 18) , month (Variable 19) , and hour of day (Variable 27) ; 
the means and standard deviations were both evaluated. In the case of 
the day of month and month variables, the observed means and standard 
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deviations correspond closely to the expected means and standard 
deviations of randan variables with a uniform distribution over the 
appropriate intervals , as shown by: 

Observed Expected (U(l, 30) or U(l, 12)) 





Mean 


Standard 

Deviation 


Mean 


Standard 

Deviation 


Day 


14.444157 


9.418326 


15.5 


8.09 


Month 


5.786589 


3.274970 


6.5 


3.18 



The mean of the hour of day is less than the expected mean by a factor 
of 2/3, and the standard deviation less by a factor of 1/2. The indi- 
cation was that the preponderance of incidents took place in the 
morning hours rather than being uniformly distributed throughout the 
day or the average daylight hours. 

(C) With respect to the people involved (Variables 21 through 26) , 
it was noted that while the average incident involved 15 Commnist- 
Terrorists, the standard deviation of this variable was almost three 
times as large as the mean. This would tend to reduce the value of the 
mean of this variable as a comparative indicator. It was also observed 
that approximately one half of all incident reports included a positive 
response to Variable 24 (other ethnic groups) . Dependent on the 
judgement of operational usefulness of ethnic information, it could be 
worthwhile to expand the ethnic variable categories, or to generate a 
numerical code for the groups to be included in this variable. 
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III. FACTOR ANALYSIS (U) 

(U) An abjective in the analysis of a large set of data which has 
the potential of producing significant or useful results, is a 
reduction in the size of the data set. If this can be done without 
destroying the useful value of the information contained in the data, 
the result is a smaller set of data to manipulate, thus simplifying 
additional analysis efforts. It is the objective of factor analysis 
to represent a set of variables in terms of a smaller set of factors 
which can be constructed from the data. 

(U) The simplest model for representing one variable in terms of 
other variables is the linear model which is used in most factor 
analyses. Within the linear framework there are two distinct 
approaches: to extract the maximum variance fran the data, or to maxi- 
mally reproduce the correlations among the variables. Harmon [2] and 
Morrison [3] were considered to be the better references for the 
material which follows. For consistency, the notation provided by 
Harmon was used wherever possible. 

A. PRINCIPAL COMPONENTS ANALYSIS (U) 



(U) The method of principal components empirically reduces a large 
set of data so that n original variables are describable in terms of 
F uncorrelated components. If x^ is an original variable, the model is 

(3-1) Xj = a^F^ + a^ 2 F 2 + ... + a^^ (j = 1, 2, . . . , n) . 

All n components are required to reproduce the correlations among the 
n original variables, but each component, in turn, makes a maximum 
contribution to the sum of the variances of the original variables . 

That 
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vdiich contributes a maximum to the sum of the variances of the original 
variables, contributes a maximum to the raraining variance, and so 
on; with the sum of the variances of the n principal components being 
equal to the sum of the variances of the n original variables. Conse- 
quently, depending upon the amount of total variance which it is desired 
to retain in the reduced data set, only a small number of the components 
will be required to represent the original data in a practical application 
of the method. 

1. Geometric Interpretation (U) 

(U) The principal components solution can be described geometri- 
cally by considering a scatter of p observed values of the n variables 
in n-space. Harmon [2] proves that for the principal component solution, 
where the rank of the correlation matrix is n, that the ccmrion- factor 
space for these observations is n-dimensional . The swarms of points will 
be generally concentric, similiar, and similarly situated ellipsoids? 
exactly so for a normally distributed population [7]. If the major axis 
of the ellipsoid is defined as the axis passing through the direction of 
maximum variance of the points, Morrison [3] shows this major axis 
corresponds to the first principal component when the axis is rotated to 
maximize variance. The maximizing solution would be the characteristic 
vector of the greatest root (eigenvalue) of the sample correlation 
matrix. The ranaining roots and characteristic vectors of the correlation 
matrix would then determine the orientation of the higher ordered princi- 
pal component axes. The geometric interpretation also implicitly 
contains the minimizing of the sums of squares distances from each point 
to its projection on each of the successive n coordinate axes. Mini- 
mization of the sums of squares distances is equivalent to the 
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maximization of variances, and an orthogonal least-squares solution to 
the best fitting line to the swarm of points would yield the first 
principal canponent axis directly. Successive orthogonal least-squares 
solutions would yield the remaining n-1 axes. 

2 . Limitations of Principal Components (U) 

(U) Principal components was described above as being highly 
dependent of the sums of the variances of the original variables. If 
these variables are not measured in the same units, changing the scale 
or performing other linear transformations of the variables could so 
change the shapes of the ellipsoids until the axes (principal compo- 
nents) would have no consistent meaning and the results of the analysis 
would not be useful in any practical sense. 

B. CLASSICAL FACTOR ANALYSIS (U) 



(U) Classical factor analysis (which will be referred to as factor 
analysis in the following discussion to distinguish the classical factor 
analysis model from the principal components analysis model) generally 
uses the second approach to factor analysis in that the model is 
basically designed to maximally reproduce the correlations among the 
original variables. The model describes each of the original n 
variables (x^, i = 1, 2, . . . , n) in terms of m ccmmon factors and a 
unique factor by 



( 3 - 2 ) 



x. = a..F. + a.-F,. + ... + a. F + d.U. . 
l ill i2 2 amm 11 



The number of carmon factors, m, is usually much smaller than n, and 
these m factors account for the correlations among the original 
variables. The unique factor accounts for error and the remaining 
variance of the variable. In this model the F's are considered random 
variables and the .correlations among the variables of a given sample 
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are treated as if they were the population correlations; that is, 
statistical variations are ignored and no assumptions are made, in the 
general case, about the statistical distributions of the original 
variables. Since the means and variances of the F's and U's are unknown 
in practice, they can be assumed to be zero and unity respectively with 
no loss of generality in the model; additionally, the n unique factors 
are supposed to be mutually independent and independent of the m 
canmon factors. The basic problem of factor analysis is to estimate the 
nm loadings of the common factors [2, Chap. 2]. 

1. Principal Factor Solution (U) 

(U) Hannon states that the principal factor solution is 
probably the most widely used technique in factor analysis. The tech- 
nique is adapted frcm the principal components solution to the factor 
analysis model (3-2) , and is distinct frcm principal components in the 
amount of variance analysed. The ccrrmunalities of the variables are 
placed on the diagonal of the correlation matrix to obtain a reduced 
matrix for principal factor analysis. Where, in the principal compo- 
nents model, the principal components can be expressed in terms of the 
observed variables, approximating procedures must be used in determining 
the principal factors. 

(U) The sum of squares of factor coefficients gives the ccnmunality 
of a specified variable where one coefficient is representative of the 
contribution of that factor to the communality of the variable. The 
principal factor method selects successive order ed-f actor coefficients 
so as to make the sum of the contributions of the ordered factors to the 
total oormunality a maximum. For the first factor this sum is given by 



(3-3) 
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with each chosen to maximize V under the constraint 



m 



(3-4) 



r^,_ = E a. a, 

P=1 1P ^ (i/k = 1, 2, . .. , n) , 



'ik 



where r^ is the camtunality of variable i and r^ = r^. The con- 
straint conditions are the replacement of the observed correlations by 
the reproduced correlations which implies the assumption of zero 
residuals. 

The solution technique for this constrained maximization problem pro- 
posed by Harmon [2] is the application of Lagrange multipliers, u^ , 
which defines a new function T, such that 



n 



n 



(3-5) 



m 

2T = V, - E u., r. = V - E E u.. a. a, . 
1 j,k=l ** 1 j,x-lp.l 



The partial derivatives of the new function T with respect to any one 
of the n variables a_.^ are set equal to zero, giving. 



(3-6) 



8T 



n 

= a . , - E u. 



3a jp ~ " jl kil “ jkakp 



= 0 



while the partial derivatives with respect to any of the other 

coefficients, a. (p^ 1) are set equal to zero, giving 
Jlr 



(3-7) 



8T 



n 

= - E 



8a j P ki 



= 0 , p 7* 1 



These sets of equations are combined as follows: 

n 



(3-8) 



8T 



3a. 



jp ~ 6 lp a jl U jk a kp " 0 ' p “ 1# 2 ' •*" 



where the Kronecker Delta, 6, = 1 if p = 1, and 6^ = 0 if p ^ 1. 

If the combined equations are multiplied by a^ and the sunning 
operation completed with respect to j this gives 



(3-9) 



n 

E a?. 



! P j=1 ji 



n n 



j=l k=l 
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n 

Frcm equation (3-6), Z = a^ , 

then equation (3-9) can be written as 
(3-10) 6 lp \ ” Z = °* 



n 



and if I a?, = A, , 

j=i 31 i 



If this equation is multiplied by a. and the summing operation 

DP 

completed with respect to p , the resulting equation is 
(3-11) a jl* \ ~ ^ ai -“ i ^ a - ^ = 



n m 

— I cL n Z a . a_ 
k=l ^ p=l DP * p 



When the constraining conditions in equation (3-4) are applied to this 
equation, the result is 



(3-12) 



n 

Z r^ 

k=l 



jk\i _ - \ a ji 



= 0 



which is a set of n equations, one equation for each value of j, the 
first of which can be written, in terms of catmunality (r^ = h^) as 

(3-13) (h^ - A)a^ + r 22 a 2i + ••• + r in a n i = °* 

Hadley [6] has shown that a necessary and sufficient condition for this 

type set of equations to have a nontrivial solution is the vanishing of 

the determinant of the coefficients of the unknowns, a^. The expansion 

th 

of the determinant is an n“ order polynomial in A. This polynomial , 
the characteristic equation of the set of equations, has a family of 
solutions, all of which are proportional to one solution, where the 
factor of proportionality is 



n 



(3-14) 



A, = Z a .. 
1 j=i * 



However, this is the equation for , (equation (3-3)), the quantity to 
be maximized; that is, the maximizing solution to is the largest root, 
or eigenvalue, of the characteristic equation. To find the coefficients 
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of the first factor, F^, which will account for the maximum amount of 



coirmunality possible, the value of ^ is substituted into the set of 
equations of type (3-13) and a solution vector, eigenvector, (a) is 
obtained. To satisfy the conditions of equation (3-3) , 

a jl ffi 



(3-15) 



a jl = 



/ 



“ll + a 21 + 



+ a 2 . 

nl 



(j = 1, 2, ... , n) 



gives the coefficients for the first factor, F^ 

The resulting problem is to then find the coefficients for the factor, 
F 2 , which will account for a maximum amount of the residual ccmmunality. 



The residual correlations after the first factor has been determined 
are given by 



(3 ' 16) r jk - r jk ' a jAl = a j2\2 + *33^3 + • • ' • + Vta 

and the quantity to be maximized is 

(3-17) V 2 = a 2 2 + a 2 2 + ... + a 2 2 , 

subject to the constraint conditions in equation (3-16) . Iteration of 
the Lagrange multiplier technique yields > 2 , the second largest eigen- 
value of the solution to the equations of type (3-13) as the maximizing 
value of V 2 « The factor coefficients are then determined in the same 
manner, using equation (3-15) , as was used for the first factor. The 
procedure is applied simultaneously in practice to obtain all factor 
coefficients, each corresponding to successively decreasing values of 
the eigenvalues. 
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C. APPLICATION TO VIST DATA (U) 

(U) The 27 variables [1] were retained from the analysis of the 
initial, 1592 reports, data set, and are defined in Appendix D. Bio- 
medical Computer Program (BIMED) Q3M [4] was used for principal compo- 
nents analysis and BIMED X72 [5] was used for principal factor analysis. 
The computational procedures for these programs are described in 
Appendix E and Appendix F respectively. 

(C) The two initial runs were made for comparison of the factor 
patterns from the expanded data set with those obtained by Gardner using 
the same factor analysis technique. The first run was for the total 
data set. In geographical area coverage as well as time period coverage, 
the initial data set was a subset of the expanded data base. The results 
of this comparison indicated a general agreement among the factor 
loadings with recording of the factors accomplished. The order of the 
11 factors from the initial data corresponded to the following order of 
10 factors from the expanded data: 

1592 -123456789 10 11 
6264 -213459687 No 

The tenth factor from the expanded data loaded very high (VH) on variable 
19, month of incident; there was no corresponding loading with the initial 
data. The initial data factor 10 loaded on Vietnamese/Pathet Lao which 
loaded high negative (H-) in the expanded data factor 5. The very high 
(VH) loading for Non-Ccmmunists for factor 11 from the initial data did 
not appear in the factor pattern from the expanded data. 

(C) These results agreed with the anticipated result of only general 
correspondence between the first few factors which was expected because 
of the divergent area and time coverage. 

y , t | | | I 4 4 4 4 4 4 4 4 1 4 4 4 4 / 4 4 4 4 4 4 4 4 4 4 4 4 

l | 4 >44 4 >444 4 44444 

) 4 444 4 444 4 44 44 4 4 4 4444) 4 

f I 44 44 4 444 4 444444 

1 4 4 4444 444 4444 444 444 444 44 4 4»4 4 4 4 4 
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(C) The second run compared 415 incident reports in Amphoe Chiang 
Khong, Changwat Chiang Rai , with 312 incident reports for the same area 
in the initial data. The resulting factors ordered as follows: 

312 -123456 789 10 
415 - 1 2 3 6 5 4 No 7 9 No 

The expanded data analysis generated factors which could be interpreted 
as Police/Non-Corminist Terrorists and Agent Reported in the seventh and 
tenth positions. These factors were not found in the initial factors. 

The Date/Time and Paramilitary factors in the initial analysis were not 
reproduced from the expanded data set. However, the factor patterns 
obtained from the same area with an increase in time period only were in 
closer agreement than the total data set factor patterns. The factor 
patterns obtained from the initial data are reproduced in Appendix G. 

1. Analysis of a Changwat (U) 

(C) Changwat Nakhon Phancm was selected for more detailed 
applications of the factor analysis techniques to the data base. Nakhon 
Phancm lies in northeast Thailand on the border with Laos. Orientation 
maps of the area are shown in Appendix H. The first instances of an 
active insurgency in Thailand were observed in Amphoe Na Kae and Nakhon 
Phancm has had the highest proportion of incident reports (2526/6264) of 
any changwat in Thailand. Nakhon Phanom consists of ten Amphoes which 
are very divergent in their size, ethnic populations, population densities, 
terrain, transportation and communication networks, and level of 
insurgency characteristics [ 8 ] . Incident reports ranged frcm 16 in the 
flat Amphoe Si Songkhram in the North to 1266 in the mountainous Amphoe 
Na Kae in the South. 
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(C) Principal factor analyses and principal components analyses 
were conducted on representative sizes of data sets from this area. 
The resulting factor matrices and the factor matrices for the total 
data set are contained in COMPUTER OUTPUT, PART 1. The areas are 
identified in Table II. 



TABLE II (CONFIDENTIAL) 
GEOGRAPHICAL AREAS FOR FACTOR ANALYSIS (U) 



Area No. 


Code 


Geographical Name Number of Ii 


1 


2006 


Arrphoe Ban Phaeng 


46 


2 


2001 


Amphoe Muang 


93 


3 


2008 


King Amphoe Don Tan 


140 


4 


2004 


Arrphoe That Phancm 


196 


5 


2007 


Amphoe Mukdahan 


358 


6 


2005 


Arrphoe Na Kae 


1266 


7 


20 


Changwat Nakhon Phancm 


2526 


8 


— 


North and Northeast Thailand 


6264 



The first six areas were disjoint and independent, together they formed 
a subset of the seventh area, and the seventh a subset of the eighth. 
Study of the factor matrices indicated that as the number of incidents 
considered increased, the principal factor and principal component 
factor matrices tended to become more similar, that is, to generate the 
same factor loadings. This indication was examined in more detail by 
measuring the differences between corresponding elanents of the factor 
matrices for each sample size, accounting for the bipolar nature of the 
factor loading vectors where necessary The measures of difference 
considered were: the average difference in the corresponding elements, 
and the average sum of squared differences in the corresponding elanents. 
The result of this analysis is shown in Table III. 
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TABLE III (UNCLASSIFIED) 



CORRESPONDING ELEMENT COMPARISONS OF FACTOR MATRICES (U) 
Sample Size Average Difference 



46 


0.0153366 


93 


0.0442839 


140 


0.0474198 


196 


0.0511202 


358 


0.0511445 


1266 


0.0380625 


2526 


0.0206374 


6264 


0.0028734 



Sum of Squared Differences 

0.0000020 

0.0007640 

0.0008715 

0.0000864 

0.0026657 

0.0001932 

0.0000000 

0.0000000 



(U) Based on the results shown in Table III/ it was concluded that, 
given the data base considered, the two procedures tend to generate the 
same factors with a sufficiently large sample size. 

2. Analysis of Generated Data (U) 

(U) In order to examine the conclusion that principal components 
analysis and principal factor analysis tend to generate the same factors 
using the same data with a sufficiently large sample size in a more 
general sense than using the VIST data set, two 4000-occurrence data 
sets were generated. For each occurrence, three independent standard 
normal observations were taken, 911 ^kitrary (4x3) 

coefficient matrix, A, was applied to the Y vector to generate four 
normal randan variables, x^/X^/X^/X^, which were not independent. In 
matrix terms: 

(3-18) X = AY T 

Using the same A matrix and Y vector, four dichotomous , dependent, and 
discrete random variables, x^/X^/X^/X^, were generated by: 
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(3-19) X- = 1, yi^ 1 +y 2 a 12 +y 3 a i 3 >a 11 ta yl +a 12 p y2+ a i3 Uy3 

= 0, otherwise 
i = 1, 2, 3, 4 

The following notation was applied. 

Let: (f n be a principal factor (n by m) factor matrix from 

normally distributed data, 

(f^ be a principal factor factor matrix frcm discrete data, 

(p n y) be a principal component factor matrix frcm normally 
distributed data, and 

(p, . .) be a principal component factor matrix frcm discrete data, 
cl 1J 

The measures of difference between factor matrices were defined by 

n m 

(3-20) Mn = I I (f . . - p . .) 2 

i=1 j =1 n 13 *n i] 



mn 



for normally distributed data, and 



n m 

(3-21) m = I I (f- . • - p- . ,) 2 
i=l j=l d 13 d 13 
mn 

for discrete data. The results shown in Table IV were obtained. 
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TABLE IV (UNCLASSIFIED) 

COMPARISON OF PRINCIPAL FACTOR AND PRINCIPAL COMPONENT 
FACTOR ANALYSIS USING NORMALLY DISTRIBUTED AND DISCRETE DATA (U) 



itple Size 


Mn x 10 ^ 


Md x 10 


30 


2.567 


13.595 


60 


35.281 


17.764 


100 


20.828 


15.467 


200 


18.294 


6.454 


600 


21.886 


4.067 


1000 


23.732 


4.346 


2000 


21.462 


3.332 


4000 


0.000 


0.000 



Based on the results, and qualified by the limitation of the amount of 
testing conducted, it was concluded that the analysis with generated 
data tended to support the results of the analysis of the VIST data, 
and without regard for the underlying distributions of the data. 

(U) Further analysis was undertaken to examine the effects of 
increasing sample size, considering only the case of normally distri- 
buted data. It was observed that the canputed correlation matrices, 
eigenvalues, and the cumulative proportion of total variance accounted 
for by each factor were identical (within the computational limitations 
of the computer) regardless of which factor analysis technique was 
applied, for a given sample size. This tended to confirm the conclusion 
based on the theoretical presentation of the techniques, that the 
differences in the solutions were a result of the ccnmmalities computed 
by the principal factor method. It was also observed that the indi- 
vidual elements of the correlation matrices, the eigenvalues, and the 
cumulative propertions of total variances, each tended to converge 
toward a specific, but unknown, value with increasing sample size; as 
would be expected frcm the Law of Large Numbers. It was then noted that 
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in the principal factor solution, the change in the value of a given 
factor loading for a variable was in the same direction, increasing or 
decreasing, as the change in the carmunality for that variable; and 
the magnitude of the change was generally proportional to the magnitude 
of the change in the carmunality . This led to an examination of the 
comparative stability of the factor loadings with respect to sample 
size under the principal factor and principal components solutions. 

The changes in carmunality are shown in Table V, changes in factor 
loadings for the principal factor solution factor one in Table VI, and 
the changes in the factor loadings for the principal components solution 
factor one in Table VII. 



TABLE V (UNCLASSIFIED) 
COMMONALITIES VS INCREASING SAMPLE SIZE (U) 



Sample Size 

30 

60 

100 

200 

600 

1000 

2000 

4000 



Variable 1 

.789517 

.744993 

.762194 

.762222 

.758955 

.758638 

.760840 

.624458 



Variable 2 

.929347 

.876937 

.897185 

.897218 

.893372 

.892999 

.895591 

.932500 



Variable 3 

.962307 

.908039 

.929005 

.929039 

.925057 

.924670 

.927355 

.975567 



Variable 4 

.972257 

.917428 

.938611 

.938645 

.934622 

.934231 

.936943 

.985781 



TABLE VI (UNCLASSIFIED) 



CHANGES IN FACTOR LOADINGS VS SAMPLE SIZE, PRINCIPAL FACTOR (U) 



Sample Size 

30-60 

60-100 

100-200 

200-600 

600-1000 

1000-2000 

2000-4000 



Variable 1 

-.02542 

.00991 

.00001 

-.00187 

-.00018 

.00126 

-.08203 
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Variable 2 

-.02578 

.01075 

.00002 

-.00204 

-.00019 

.00237 

.01930 
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Variable 3 

-.02806 
. 01094 
.00002 
-.00207 
-.00020 
.00139 
.02472 
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Variable 4 

-.02821 

.01100 

.00002 

-.00108 

-.00020 

.00140 

.02491 
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TABLE VII (UNCLASSIFIED) 

CHANGES IN FACTOR HEADINGS VS SAMPLE SIZE, PRINCIPAL COMPONENTS (U) 
Sairple Size Variable 1 Variable 2 Variable 3 Variable 4 



30-60 


-.10249 


-.02048 


-.00219 


.00176 


60-100 


.03436 


.01528 


-.00009 


-.00242 


100-200 


.00328 


-.00515 


.00047 


.00201 


200-600 


-.00918 


.00081 


-.00037 


-.00072 


600-1000 


.00039 


-.00022 


.00017 


.00040 


1000-2000 


.00496 


.00067 


.00000 


.00018 


2000-4000 


-.00146 


.00004 


-.00006 


-.00015 
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The indication was that the factor loadings from the principal component 
solution tended to converge to sane specific value in the same manner 
as the correlation matrix. Combining this indication with the apparent 
relationship between factor loadings and commonalities from the princi- 
pal factor solution, and the apparent tendency toward similar solutions 
from the two techniques with increasing sample size, the indication 
that the factor loadings frcm the principal factor solution tended to 
converge to seme specific value as a function of the convergence of 
the correlation matrix and the carmunality, where the canmunality of 
a variable replaced its autocorrelation prior to the final iteration 
of the principal factor solution technique. 

(U) There remained the question of why the two techniques, each 
with the same correlation matrix, produced different factor matrices 
at small sample sizes; and why the two techniques produced essentially 
the same factor matrices at large sanple sizes, again each starting 
with the same correlation matrix. That is, what is the relationship 
between sample size and communality; the theoretical indications are 
that they are unrelated, but the empirical evidence supports conver- 
gence of the solutions at large sanple sizes. A satisfactory answer 
to this question was not found. 
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IV. FORECASTING INSURGENT LEVELS OF ACTIVITY (U) 

(U) The capability of completely and accurately anticipating or 
predicting an opponents course of action would be of inestimable value 
in any conflict situation. While complete and accurate prediction is 
an ultimate goal, it has rarely, if ever, been achieved in a military 
situation. In general, the military corrmander must use estimates of 
opposition capabilities which are based on a subjective evaluation of 
the degree of truth and accuracy of all the information which has been 
gathered concerning the opposition. The commander can then apply seme 
type of subjective probability to the alternative courses of action 
within opposition capability in order to arrive at a decision for his 
cwn best course of action. The amount of probability the commander 
applies could be influenced by: personal experience and military 
judgement, the amount of available intelligence information, historical 
information, the results of war games or staff studies, or any of many 
other factors. 

(U) Within the limited military intelligence experience of the 
author, mathematical forecasting techniques have not been used in an 
insurgency environment as one of the factors to influence the planning 
of counterinsurgency operations. The VIST data provided a base for 
examining same forecasting techniques and considering criteria for 
their potential usefulness in counterinsurgency planning. 
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A. FORECASTING TECHNIQUES (U) 

(U) To be applicable for tactical use in a counterinsurgency 
environment, a forecasting technique should: be relatively simple in 
a mathematical sense, not requiring the use of sophisticated computer 
equipment or extensive hand computations ; not require the maintenance 
of a large file of historical data; have flexibility in adjusting the 
rate of response; and, of course, be relatively accurate depending on 
the canmand level and the specific use intended for the forecasts. 

(U) The above criteria led to the selection of three algebraic 
models and their associated smoothing techniques for forecasting total 
insurgent level of activity in a specified geographical area. Brown 
[9] provided the majority of the information for the following 
discussion. 

1. Constant Model (U) 

(U) This is the appropriate model where there is seme reason 
to accept the hypothesis that the insurgency is relatively stable 
within the area and time span being considered. The number of incidents 
occur ing within one time unit x fc , could then be represented by 
(4-1) x fc = a + e t 

where the average value of the error term, e^ , is zero in all the 
techniques being considered. The basic problem in the model is to 
estimate the value of a. 

a. Moving Averages (U) 

(U) The moving averages method provides a simple straight- 
forward technique for estimating a which does not require extensive 
historical data. Considering the N most recent observations, 

:> : unclassified 
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(4-2) M fc = x t + x t _ x + . . . + x fc _ N 

_ 



gives the average of these N observations and the current estimate 
for a, the forecast for the number of incidents in the next time period. 
When the next observation, x t+ ^, is received, the forecast error is 
given by 

( 4 - 3 ) E = M t - x t+] _ 



and the new average is 



(4-4) M t+1 = M t + x t+1 - x t _ N 

N 



which remains the average of the N most recent observations. 

(U) Moving averages provides an accurate estimate in that the 
average minimizes the sum of squares of the differences between the N 
most recent observations and the estimate of the next observation. The 
major disadvantage to moving averages is the flexibility in adjusting 
the rate of response. The rate of response is controlled by N, the 
larger of the value of N, the more stable the estimates and the more 
data which must be maintained on file. To have the capability of 
increasing the stability of the model, desirable if the level of insur- 
gent activity was, in fact, constant, would require the additional 
maintenance of historical data no longer being used in the forecasting 
procedure. This disadvantage can be removed if x^_ N is estimated 
instead of being retained in a historical file. The best estimate for 
x^_ N is the average of all data, M^, so the estimate of the next average 
becomes 



(4-5) 



M = M + 
t+1 t 



X t+1 M t 
N 
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Since this is no longer a moving average but an estimate of an average. 
Brown changes the notation to S . If the estimation is repeated for 
successive (observations, the resultant smoothed function is 
(4-6) S fc = ax fc + (1 -a) S^ 

and the process is called exponential smoothing, 
b. Exponential Smoothing (U) 

(U) The forecast for time t+1 by the method of exponential 
smoothing is the value of S^_ computed in equation (4-6) , and it is a 
linear combination of all previous observations such that the weight 
given to an observation decreases geometrical ly with the time since 
the observation was taken; the discrete equivalent to the exponential 
decay process. The smoothing constant, a , (0 < a <_ 1) determines the 
proportion of weight which will be placed on the latest observation, 
and therefore, the response rate of the technique. The response is 
similar to moving averages with a considered like 1/N. The technique 
achieves accuracy in that it minimizes the weighted sum of the residuals 
2. Linear Model (U) 

(U) The indication of a constant changing rate in the level 
of observations can be incorporated into the forecasts by the use of 
a linear model where 

(4-7) x^ = a + bt + e^ 



represents an observed value and estimates for two coefficients are 
required for forecasting. The moving averages technique is extended 
to provide a least squares fit to the data using the N most recent 
observations. The analogous double smoothing procedure can then be 
defined by a double smoothed value, S2^_ , of the observations through 
time t which is given by 
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(4-8) S2 t = aS t + (1 - a) S^, 
and the coefficients are estimated by 
(4-9) a t = 2S t - S2 t 
and 

(4-10) ^ = a/1 - a (S - S2 ) . 

Applying these coefficients to the model, the forecast for the next 
time period is given by 

(4-11) x t+1 = 2S t - S2 + a/1 - a (S - S2 fc ) . 

The forecast error is given by 



(4-12) 



E = x t+i * x t+i 



Double anioothing requires two elements of historical data and retains 
the flexibility of response reate dependent on the selection of a 
value for the smoothing constant. Linear trends in the level of 
insurgent activity are accounted for with only a moderate increase in 
computational effort. 

3. Quadratic Model (U) 

(U) The use of higher order polynomials was demonstrated by 
the quadratic model and the triple exponential smoothing technique 
where an (observed value is represented as 
(4-13) x^ = a + bt + i/2ct 2 + e^_ 

which incorporates the indication of a change in the rate of increase 
or decrease of incident activity. Subjectively this might be described 
as an acceleration of the insurgency. Since three coefficients must be 
estimated for forecasting, a triple smoothed value, S3^ , of the 
observations is defined as 
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(4-14) S3 fc = aS2 t + (1 - «*)S3 ^ 

and the coefficients can be estimated by 
(4-15) a, = 3S, - 3S2 + S3. 

— C "C t t 

(4-16) b t = ( a/2 (1 - a) 2 ) ( (6 - 5a) S - 2 (5 - 4a)S2 t + 

(4 - 3a)S3 t ) 

(4-17) Cj. = (a 2 /2 (1 -a) ) (S fc - 2S2 fc + S3 fc ) . 

Catibining the coefficients into the model gives the of recasting 
relationship for the next time period as 

(4-18) x t+1 = (2 - 3a + a 2 )S t - (6 - 2a)S2 t + 2S3 t . 

2 — 4a + 2a 2 

Three elements of historical data must be retained but, otherwise, the 
technique retains the basic characteristics of the lower ordered models. 
The computations are more extensive, hcwever the coefficients for the 
smoothed values in the forecasting equation (4-18) are constant with 
any specified value for the smoothing constant. 

B. APPLICATION TO VIST DATA (U) 

(C) Smoothing techniques for the constant, linear, and quadratic 
models were applied to Changwat Nakhon Phancm and five Amphoes in 
Nakhon Phancm. The areas were selected based on the number of incidents 
reported in the data base and the apparent degree of stability of the 
insurgency in the area in order to attempt a representative sample for 
ccmparative purposes. The areas are defined in Table VIII. 
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TABLE VIII (CONFIDENTIAL ) 



IDENTIFICATION OF GEOGRAPHIC AREAS (U) 



Area 


Geographical 


Number of 


Number of 


Number 


Area 


Incidents 


Violent Incidents 


1 


Amphoe Ban Phaeng 


46 


3 


2 


Amphoe Muang 


93 


7 


3 


Amphoe Kam Cha I 


200 


23 


4 


Amphoe Mukdahan 


355 


78 


5 


Amphoe Na Kae 


1266 


287 


6 


Changwat Nakhon Phanan 


2551 


512 



(C) The first nine months (January - September 1967) of total 
incident data were taken to establish the initial values, S t , S2^_, and 
S3^_ for the smoothing techniques. The initial values were computed by 
least squares (moving averages) with N = 3 months. A constant lead 
time of one month was used throughout the computations and smoothing 
constants of: .10, .30, .50, .70, and .90 were applied to each tech- 
nique for each area. The FORTRAN IV program for the forecasting is 
listed in the COMPUTER PROGRAM section. 

1. Results (U) 

(C) The results of the application of forecasting techniques 
are in COMPUTER OUTPUT, PART 2. Five criteria were considered as 
having the potential for being useful indicators of the accuracy of 
forecasting over time, depending on the projected use of the forecasts. 
These were: 



a. The average forecasting error, defined by 
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b. The average percentage forecasting error, defined by 

33 

Z (x. - x.)/x. 
i =10 1 . 100 

24 

where x. ^ 0 . 
i 

c. The average absolute value of the forecasting error per 
incident, defined by 

33 

Z 

i -10 
33 



d. The average sum of squared forecasting errors, defined by 

33 

Z (x. - x . ) 2 

• m 1 1 

1=10 

24 

e. The value of the maximum forecast error, defined by 

A A A 

max [ (x^q x ^q ) 1 ^^11 — ^ 11 ^ 1 • • • / ( x 33 ^ 33 ) ^ * 

No attempt was made to apply these criteria to longer range forecasts 
than one month, nor to evaluate if the techniques were accurate in any 
absolute sense. One of the criteria for relative accuracy was con- 
sidered, the average sum of squared forecasting errors, and observations 
were made as shown in Table IX concerning accuracy and normalized 
absolute forecasting error per incident compared to variations in the 
value of the smoothing constant. 
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RELATIVE ACCURACY OF FORECASTING RESULTS (U) 


Area 


Overall 


Effect of 


Number 


Best Technique 


Smoothing Constant 


1 


Constant Model 


Quadratic Model best 




a = .3 


at a = .1, otherwise 




| Error | /incident = .75 


Constant Model best 


2 


Constant Model 


Constant Model best 




a = .3 

| Error | /incident = .86 


throughout 


3 


Constant Model 


Linear Model best at 




a = .3 


a. = .1, otherwise 




| Error | /incident = .63 


Constant Model best 


4 


Constant Model 


Quadratic Model best 




a = .7 


at a = .1, otherwise 




| Error | /incident = .54 


Constant Model best 


5 


Linear Model 


Linear Model best at 




a = .1 


a = .1, otherwise 




| Error | /incident = .39 


Constant Model best 


6 


Constant Model 


Constant Model best 




a = . 1 

| Error | /incident = .24 


throughout 
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V. CONGLUSIOK'S M) ^Ua^M^BA TIONS .(U) 



(C) It was the purpose of this thesis to continue the analysis of 
data collected during the early stages of a developing insurgency. This 
was accomplished , and conclusions and recommendations were presented as 
each phase of the analysis was examined. There were sane conclusions, 
however, which were not specific to the general survey of the data or 
an analysis technique. There were weaknesses in the data; the absence 
of information concerning the government forces and operations seened 
to be the greatest obstacle to using the data in a meaningful way to 
assist in planning, conducting, or evaluating counterinsurgency 
operations — no matter what analysis techniques were applied to the 
data. The seriousness of this problem seemed apparent here; it was 
impossible to judge, however, what the effect of not having this data 
has been on analyses conducted in Thailand. c 

(C) Given that there were weaknesses in the data, there remained 
the fact that it was, and may continue to be for seme time, the 
largest and most complete set of computer retrievable information con- 
cerning the activities of (bmmunist insurgents in the early states of 
an insurgency, and the likelihood of United States military forces being 
involved in counterinsurgency operations in the future does not appear 
to be decreasing. This in itself was sufficient justification for 
analysis and recommendations which might improve the data. 




r o 
C 

' r r r 



err 

r 

r ( ' 



r r 

r c 
r, 

< r r 



r r 

r 

r r 
r 

err 



c r or err 








(C) The limitation pn ;time precluded deeper exploration into the 

> T \ > t ' 5 > ' ‘ > * * * \' * I I 1 

1 > ' -> 0 l ' 1 "> 1 > > , , > ' * > 

analysis techniques which were Applied to the d?ta^ ai\d ppr^s trained 



pprs trained 

* * ' ' ' ^ > i ■» 



the number of techniques which could be attempted . The reduction of 
the size of a data set without loss of significant information is a 
valuable elenent in determining the extent to which the data will be 
used. The operational use of factor scores as new incidents are 
reported, plus periodic updating of the factor solution, would provide 
a near real-time association between present and past insurgent 
activity — given that the validity of the factors and their interpre- 
tations were generally accepted within the using comrnunity. Fore- 
casting insurgent activity was considered important and additional 
analysis with respect to the models applied and the possibility of 
sorting types of incidents into subsets with different characteristics 
would seen to have the potential of improving the accuracy of the 
predictions, and accuracy of results would insure expanded usage. The 
final conclusion, as to the value of the recommendations for changing 
the VIST reporting systems which were made in this thesis, can only be 
reached if the reccumendations are implemented and the reporting system 
reevaluated at a later date. 
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1-2 

3-4 

5-6 

7-8 

9-16 

17-18 

19-24 

19 

20 

21 

22 

23 

24 
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28 
29 
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VIST DATA FORMAT 



Element of Information 
Changwat Code 
Amphoe Code 
Tambon Code 
Muban Code 

UTM Coordinates of Incident 
Incident Code 
Reporter of Incident 



Royal Thai Army (RTA) 



Civilian 

Police 

Paramilitary 

Source 

Unknown 

Initiator of Incident 



RTA 

Police 

Paramilitary 
Communist Terrorist 
Unknown 
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30-34 
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30 
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RTA ” ' " 


31 


Civilians 


32 


Police 


33 


Paramilitary 


34 


Questionable 


35-36 


Day of Month of Incident 


37-38 


Month 


39-40 


Year 


41-48 


Infiltration UIM Coordinates 


49-56 


UTM Coordinates From 


57-64 


UIM Coordinates To 


65 


Armed or Uniformed 


66 


Pathet Lao Involved 


67 


Tribes Involved 


68 


Vietnamese Involved 


69 


Other Ethnic Involved 


70-72 


Number of Communist Terrorists 


73-75 


Number of NonHlcmmunist Terrorists 


76-78 


(Blank) 


78-80 


Hour of Day 
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GEOGRAPHICAL CODES 






Changwat 


Code 


Anphoe 


Code 


Tambon 


Chiang Rai 


03 


Chiang Khong 


01 


Wiang 








02 


Khrung 








03 


Bun Ruang 








04 


Po 








05 


Muang Yai 








06 


Sathan 








07 


Huai So 




04 


Chiang Kham 


01 


Yuan 








02 


Chedi Dam 








03 


Chiang Raeng 








04 


Nam Waen 








05 


Fai Kwang 








06 


Wiang 








07 


Sop Bong 




05 


Chiang Saen 


01 


Wiang 








02 


Ban Saeng 








03 


Pa Sak 




07 


Thoeng 


01 


Wiang Thoeng 








02 


Ngao 








03 


Ngiu 








04 


Chiang Khian 








05 


Ta 








06 


Tap Tao 
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20 



Changwat O/de .Amphoe 
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Chiang Rai " ’ 1 ” ’ ’ ’ ” : 

(Cont.) 07 Thoeng (Cont.) 



08 Pong 



Nakhon Phanctn 01 Muang 



Code Tambon 

1 1 1 > 
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07 Plong 

08 Pa Tan 

09 Mae Loi 

10 Yang Horn 

11 Mae Pao 

01 Pong 

02 Khuan 

03 Ngim 

04 Chiang Moan 

05 Pha Chong Noi 

06 Yot 

07 Sa 

08 Oi 

61 Nai Muang 

62 Nong Saeng 

04 Kurukhu 

05 Kham Thao 

06 Dong Khwang 

07 Kham Toei 

08 Tha Kho 

09 Na Sai 

10 Na Rat Khwai 

11 Nong Yat 

13 Ban Klang 

15 At Samat 
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Code 

20 
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Changwat Code Amphoe 

' 

' • > 

Nahkon Phancm 

(Cont.) 02 Kham Cha 1 



03 Tha Uthen 



04 That Phancm 



Code Tctfibon 

' ' ' > 1 > > , ^ > 

' ' ' • ' ' 11 ' ' > ' , ! , , , 

01 Ban Song 

02 Kham Cha 1 

03 Nong Sung 

04 Nong Sung Tai 

05 Nong Ian 

06 Ban Kho 

07 Ban Lao 

01 Tha Uthen 

02 Chai Buri 

03 Tha Champa 

04 Na Khamin 

05 Ban Kho 

06 Pathai 

07 Phancm 

08 Phon Sawan 

09 Ram Rat 

10 Non Tan 

01 That Phancm 

02 Don Nang Hong 

03 Tha Lat 

04 Na Thon 

05 Nam Kam 

06 Fang Daeng 

07 Phra Klang Thung 

08 Phon Thong 
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Code 

20 



Changwat Code Miphge 

Nahkon Phancrn 04 That Phanom 
(Cont.) (Cont.) 

05 Na Kae 



06 Ban Phaeng 



07 Mukdahan 



Code 

> 


TfOT^pn 

; 


? 1 ^ 

09 


’ * I* , 

' ^ *''■>> 'I ^ 

Renu 


10 


Saen Phan 


01 


Na Kae 


02 


Kok Turn 


03 


Kan Luang 


04 


Dong Luang 


05 


Na Khu 


06 


Nong Bo 


07 


Nong Bua 


08 


Nong Sang 


09 


Phra Song 


10 


Phi Man 


11 


Phum Kae 


01 


Ban Phaeng 


02 


Na Thom 


03 


Nong Waeng 


01 


Mukdahan 


02 


Dong Yen 


03 


Na Kok 


04 


Na Sok 


05 


Ban Khok 


06 


Phung Daet 


07 


Phon Sai 


08 


Wan Yai 


09 


Si Bun Ruang 
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Changwat 


Cede 
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Amphoe 


Cttfe 
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20 


Nahkon Phancm 
(Cont. ) 


07 


- i 

Mukdahan (Cont . ) 
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N l > 

** * > > > 

Bang Sai Yai 










11 


Kham A Huan 










12 


Kham Pa Lai 










13 


Nikham Kham Soi 










14 


Na Udcrn 






08 


Don Tan (K.A.) 


01 


Don Tan 










02 


Pho Sai 










03 


Pa Rai 










04 


Ban Bak 










05 


Lao Mi 






09 


Si Songkhram 


01 


Si Songkhram 










02 


Tha Bo Songkhram 










03 


Na Dua 










04 


Na Wa 










05 


Ban Sieo 










06 


Ban Uang 










07 


Sam Phong 










08 


Na Ngua 






10 


Pla Pak (K.A.) 


01 


Pla Pak 










02 


Kut Ta Kai 










03 


Khok Sawang 










04 


Na Makhua 










05 


Maha Chai 










06 


Nong Hi 
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Code 


Changwat 


Cc/5e 


/xaphoe 




Tambon 


26 


Nan 


01 


Muang - 


02 


Kong, I-hwai 










03 


Chaiya Sathan 










04 


Du Thai 










05 


Du Phong 










06 


Thum Tong 










07 


Tha Nao 










08 


Na Sao 










09 


Na Pang 










10 


Nong Daeng 










11 


Nam Kaen 










12 


Bo 










13 


Pha Sing 










14 


Fai Kaeo 










15 


Phong 










16 


Mo Muang 










17 


Muang Tit 










18 


Muang Chang 










19 


Ruang 










20 


Suak. 










21 


Suat 










60 


Nai Wiang 






02 


Thung Chang 


01 


Lae 










02 


Ngop 










07 


Pon 
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Code Changwat 
26 Nan (Cont.) 




Code Amphoe 

03 Na Noi 



04 Pua 



05 Tha Wang Pha 



06 Sa 




1 i , 



Code 


Tairibon 


01 


Na Noi 


02 


Chiang Khong 


03 


Na Thanung 


04 


Bo Kaeo 


05 


Muang Li 


06 


Sisaket 


07 


Sathan 


08 


San tha 


01 


Pua 


02 


Ngaeng 


03 


Bo Klua Tai 


04 


Bo Klua Nua 


06 


Sila Phet 


07 


Sila Laeng 


08 


Sathan 


09 


Uan 


01 


Rim 


02 


Tan Chum 


03 


Pa Kha 


04 


Yam 


05 


Pha To 


01 


Klang Wiang 


02 


Khung 


03 


Tan Chum 


04 


Na Luang 


05 


Nam Pua 
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Code 


Changwat 


Code 


Amphoe 


Code 


Tambon 


26 


Nan (Cont . ) 


06 


Sa (Cont . ) 


06 


Nam Muap 










07 
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APPENDIX C 

INCIDENT CODES AND DEFINITIONS 


01 - Attack 


An action in which the CT use both 
gunfire and maneuver in an attempt 
to seize an objective. 


02 - Firefight 


An action in which the CT use gunfire 
alone in an attempt to inflict 
casualties and/or property damage. 


03 - Ambush 


A surprise attack usually conducted 
from planned positions, against a 
moving or temporarily halted troop, 
unit, or convoy. 


04 - Harassment 


An action in which the primary objective 
of the CT is to temporarily disrupt the 
activities of a unit, installation, vil- 
lage or activity rather than to inflict 
serious casualties or damage. Examples 
are burning of crops, sniping at a 
patrol, harassing gunfire, booby traps, 
and mining incidents of a minor nature. 


05 - Terrorism 


An action directed against civilians 
or military personnel not engaged in 
military duties in which the primary 
objective is to intimidate. This 
includes assassinations, kidnappings, 
mining or bombing of public buildings. 
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06 - Sabotage 


Normally a covert action, and causing 
damage or destruction to essential 
facilities or property so that vital 
public services, communications , or 
transportation are put out of canmission. 


07 - Armed Propaganda 


An effort by armed CT groups to indoctri- 
nate civilians in order to produce a 
favorable or neutral attitude toward 
the CT, or to implant doubt about RTG 
concern for or ability to protect its 
citizens . 


08 - Anti-Aircraft Fire 


Gunfire from the ground directed at 
aircraft in flight. 


09 - Clash 


An armed encounter between RTG forces 
and the CT, usually as the result of 
RTG patrol or sweep activities. 


10 - Armed Procurement 


CT procurement of supplies by means 
of force. 


19 - Other (Violent) 


Any type in the narrative. 


20 - Organization 


Leadership, internal structure, 
designations . 


21 - Meetings 


Meeting of CT groups for known or 
unknown purposes. 


22 - Recruitment 


Methods, membership requirements, 
admission procedures, recruitment per se. 


23 - Strength 


Of CT groups. 
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24 - Defections 


Of CT members. 


25 - Sightings 


Of individuals or groups of CT. 


26 - Training Outside 
Country 


Of personnel to be used in Thailand. 


27 - Training in Country 


Doctrine, Tactics, Subversion . 


28 - CT Camp/Base 


Location and/or description. 


29 - Deployment 


Order of battle. 


30 - Finances 


Contribution, Extortion, Taxation. 


31 - Logistics 


Supply Methods, Sources Routes, 



Material, Suppliers. 



32 - Communications 


Methods and routing including couriers. 


33 - Arms Cache 

34 - Arms Smuggling 


Stockpiles of weapons. 


36 - Infiltration 


Personnel augmentation from outside 



country. 



37 - Espionage 

38 - Counterfeiting 


Includes CT asking about RTG forces. 


39 - Propaganda 


Leaflets, broadcasts, word of mouth, 


(non-armed) 

40 - Captured CT Documents 

41 - Captured CT Material 


themes. 


42 - Arrests, Capture 


Of CT suspects. 


43 - Release 

44 - Helicopter Sighting 

45 - Helicopter Landing 


Of CT suspects. 


50 - CT Plans 


Proposed future actions of a violent 



nature. 



99 - Other (Nonviolent) 


Include suspect CT. 
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APPENDIX D 

VARIABLES USED IN ANALYSIS 
Incident Cede 
RTA Reported 
Civilian Reported 
Police Reported 
Paramilitary Reported 
Source Reported 
Unknown Reporter 
RTA Initiated 
Police Initiated 
Paramilitary Initiated 
Canmunist Terrorist Initiated 
Unknown Initiator 
RTA Target 
Civilian Target 
Police Target 
Paramilitary Target 
Questionable Target 
Day of Month 
Month 

Armed or Uniformed 
Pathet Lao Involved 
Tribes Involved 
Vietnamese Involved 
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24 Other Ethnic Groups Involved 

25 Number of Cormunist Terrorists 

26 Number of Non-Ccmmunist Terrorists 

27 Hour of Day 
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APPENDIX E 

COMPUTATIONAL PROCEDURE, BIMED 03M 

Step 1. The data re in the form x^_., (i = 1,2, ..., n, cases) , 

(j = 1, 2, ...,p, variables). The means are computed by 

n 

X. = I X. ./n 
3 i=l 13 



the standard deviations 



v / n 

s. = E (X. . - X.) 2 



i=l 



il D 



n-1 



and the correlation coefficients 



n 



r. . = 



E (X . - X.) (X . - X.) 
n ai l an j 
a=l J 



13 /IT 



n 



E (X . - X.) 2 E (X . - X.) 2 
, ai 1/ a] j' 

a=l a=l J J 



Step 2. The diagonal elements of the correlation matrix are adjusted 
to unity. 

Step 3. The p eigenvalues and p eigenvectors of the adjusted corre- 
lation matrix, R, are obtained, solving the systen 

RV = V A, V*V = I 
th 

where A^ is the j eigenvalue, and B . = (v^, v 2 j ' V pj^ 

.th . 

j eigenvector. 

Step 4. The program determines the number of factors to be rotated, 
q , by the formula 

q = min(k, m) 

where k is the number of eigenvalues greater than c (an input to the 
program, c = 1.0) , and m is a specified number of factors (input m = 13) 
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Step 5. The coefficients of each factor are obtained by 




/X7 v. . 
d id 



i - 1/ 2, < < . , p 
j = I/ 2 , • • • , q 



and the p x q- factor matrix, A = (a^_.) is printed. 

Step 6. A factor check matrix is computed, given that p /q, by 

T 

C = A - A 

where C is a p x q matrix containing the eigenvalues on the diagonal. 
Step 7. Orthogonal rotations are performed on the factor matrix to 
maximize 



V = E [pi (a? -/h? - (E (a? ./h?) ) 2 ] 
j i J i J 

where i = 1, . .., p are variables; j = 1, ..., q are factors; and h? 

th 

is the cormunality of the l variable defined by 
h? = E a?. . 

1 j 13 

Step 8. The normalized factor matrix is computed by 



b.. 

13 ’ r hT 

1 



Step 9. A computational loop is started through Step 12 until the 
convergence test is passed in Step 9 and the program transfers to 
Step 13. The variance is computed, factors by 
s. = [pE(b?.) 2 - (Eb?.) 2 ]/p 2 
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and if |V c - V c _^ | < 10 7 four successive times, the program transfers to 
Step 13. 

Step 10. The maximization criterion is done for two factors at a time. 
If x and y are column vectors of normalized coefficients (factor 
loadings) , then, treating them as constants, 

Cos <p - Sin <f> 

(x, y) x [ ] = (X, Y) 

Sin <t> Cos <p 

where X and Y, the desired normalized loadings, are functions of <f>, 

the angle of rotation. Necessary conditions for a maximum are: 

A = Z (x. + y.) (x. - y.) 
l l 

B = 2 Z x.y. 

C = I I (x. + y. ) (x. - y. ) + 2x.y. ] [ (x. + y. ) 

1 -*1 1 1 

<x i - ^ 

D = 4 I (x. + y.) (x. - y.) x. y. 

1 -*1 1 -*1 1 ^1 

NUM = D - 2 A B/p 

DEN = C - (A + B) (A - B) /p 

<f>' =1/4 AIOAN(NUM/DEN) 

If <j>'<_ 1/4 degree the program goes to Step 12; otherwise, to Step 11. 
Step 11. If DEN is positive, the program computes: 

|Cos <p | = Cos (J)' 

| Sin <f> | = Sin <f> ' 
otherwise, it computes: 

|Cos cp | = W 2/2 ) (Cos <j) ' + Sin <J>') 

| Sin | = (i/ 272) (Cos <J>' + Sin <J>') 
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and if NIM is positive, the program computes : 

Cos <j> = | Cos <j> | 

Sin (j> = | Sin <j> | 
otherwise, it computes : 

Cos <f> = | Cos <f> | 

Sin <() = - | Sin <j> | 
and goes to Step 12. 

Step 12. One iteration cycle is completed When all pairwise single- 
plane rotations (Steps 10 and 11) are made on the normalized factors; 
the program then goes to Step 9. 

Step 13. The final rotated normalized factors are unnormalized by 



and the rotated unnormalized factor matrix. A, is printed. Final com- 
munalities and the difference from initial ooirrrtunalities are computed 
and printed, and the original and successive variance are printed. 



» > > 



• t > 



* » » > » » > } 

• ^ r > > > > > 
€• 6 » * ► * 

> > > > 
> i f * » > > > 



UNCLASSIFIED 

* » ^ * 



> ) » 



r r r 



r <• 



r r r r 

. * 

. r r r 

r r 

f . r r r f r 


















l C i l l. ■ c 

p r 

t p r 

«. r r <• P 

< Pi r t i C t 



U t < 



t. t 






I ♦ » » • 



f f» » ' 





UNCLASSIFIED 



> > ^ 



APPENDIX F 

COMPUTATIONAL PROCEDURE, BIMED X72 
Step 1. The data are in the form X„ , (i = 1, 2, n, cases), 

(j = 1, 2, . . . , p, variables). The data are read casewise and the 
method of provisional means is used to canpute cross products 
recursively by 

- (k+1) - (k) 

x. = x 

l 



= x. + D. 
l l 



(k+1) (k) 

S.. = S. . + k(k+l)D.D. 

i] 

- (k) 



D i = <x k+i,i ' x i )/(k+1) - 

The means, convariances , standard deviations, and correlations are 



given by 



-(n) 



x. = x. 

l l 



(n) 

s. . = S. . /n, 
id ID 



1/2 



and 



s. = (s. . ) 
l li 



r . . = s. ./s.s . . 
ID ID i D 



Step 2. As required for initial conminality estimates, the diagonal 

of the matrix (r. .) is left unaltered. 

ID 

Step 3. The matrix (r^ ) is tri -diagonalized by the method of House- 
holder, its eigenvalues obtained by means of Sturm sequences, and its 
eigenvectors found by the method of Wilkinson. All of the eigenvalues 
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Xj , A^, "n 33:0 °^ l '*' a: '- ne< ^' the number r of eigenvectors v^, , 

... , v r is equal to the min (a , and max (i : A^>c) ) where a is the 
specified number of factors (input a = 13) and c is the specified 
minimum eigenvalue (input c = 1.0) . The j"*'* 1 column of the factor 
loading matrix (k^j) is /aTvT . New corrmunality estimates are given by 

h? = Ik?. . 

1 j 13 

If more than one iteration for canrnunalities is required, the diagonal 
of (r^j) is replaced by the new estimates and this step is repeated 
from the beginning. Iteration is continued until a specified maximum 
number of iterations or until the maximum change in the carmunality 
estimates h? is less than .001. Initial and final carmunality 
estimates are printed together with the final factor loadings. 

Step 4. Kaiser normalizations are applied to the loadings by 

k! . = k. ,/h. , 

ID ID i 

and rotations are performed to minimize the criterion 



G{(k. .)) = ( Pp k? k?. - - ( P k?)( P k?.>) 

« a#> i=l 13 113 P i=l 



ia- ■ i=1 - ib 



viier e 1/p specified an orthogonal varimax rotation. Rotations are 
made by pairwise modifications of columns of (k^) through all pairs. 
Complete cycles are continued until the change in G relative to its 

_5 

initial value is less than 10 . The Kaiser normalizations are 

corrected for by the inverse relation 

k! . = k. .h. 

ID id i 

and the final values of k„ are the primary factor loadings. 



> • i > b 



» * » t * t 



6 8 , 
> t t 



:UJ?CL£SSIFIED 



c r f r 



err 



erf' 



r r 






APPENDIX G 



INITIAL DATA FACTOR PATTERNS (U) 



Total Data Factor Names and Numbers 
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Variables 
No. Name 

1 Incident Code 

2 REA Reported 

3 Civilian 

Reported 

4 Police Reported 

5 Paramilitary 

Reported 

6 Source Reported 

7 Unknown 

Reporter 

8 REA Initiated 

9 Police Initiated 

10 Paramilitary 

Initiated 

11 CT Initiated 

12 Unknown 

Initiator 

13 REA Target 

14 Civilian Target 

15 Police Target 
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Name 
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16 Paramilitary 

Target 

17 Unknown Target 

18 Day 

19 Month 

20 Arms or Uniforms 

21 Pathet Lao 

22 Tribesmen 

23 Vietnamese 

24 Other Ethnic 

Groups 

25 Number of CT 

26 Number of Non-CT 

27 Hour 
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MAX ABSOLUTE FORECASTING ERROR 4.0317 -5.8216 7.2252 
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AVERAGE % FORECASTING ERROR 64.6999 106.9222 160.8391 
AVG | FORECASTING ERROR | / 1 NCI DENT 0.7166 0.9334 1.3951 
AVG SQUARED FORECASTING ERROR 3.3905 6.2225 12.4715 
MAX ABSOLUTE FORECASTING ERROR 4.5953 6.3297 -8.6351 
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FORECASTING LEVEL OF INCIDENT ACTIVITY 
AREA NUMBER 1 
ALPHA LEVEL = .90 
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FORECASTING LEVEL OF INCIDENT ACTIVITY 
AREA NUMBER 2 
ALPHA LEVEL = .10 
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FORECASTING LEVEL OF INCIDENT ACTIVITY 
AREA NUMBER 2 
ALPHA LEVEL = .30 
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